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Abstract 

pH ■ The most commonly used relative abundance index in stock assess- 

■^T ' ments of longline fisheries is catch per unit effort (CPUE), here defined 

as the number of fish of the targeted species caught per hook and minute 
C^ ■ of soak time. Longline CPUE can be affected by interspecific competition 

^* and the retrieval of unbaited or empty hooks, and interannual variation in 

these can lead to biases in the apparent abundance trends in the CPUE. 
Interspecific competition on longlines has been previously studied but the 
return of empty hooks is ignored in all current treatments of longline 
CPUE. In this work we propose some different methods to build indices 
to address the interspecific competition that relates to empty hooks. We 
qq , show that in the absence of information about empty hooks, the relative 

f*^ ■ abundance estimates have constant biases with respect to fish density and 

this is typically not problematic for stock assessment. The simple CPUE 
index behaves poorly in every scenario. Understanding the reasons for 
empty hooks allows selection of the appropriate index. A scientific long- 
line survey is conducted every two years in the Strait of Georgia, British 
Columbia by Fisheries and Oceans Canada. The above methods are ap- 
plied to build the time-series of indices from 2003 to 2009 for quillback 
k>( \ rockfish (Sebastes maliger). Due to variation in the incidence of non-target 

j_j . species, the index trend obtained is moderately sensitive to the choice of 

C^ ' the estimator. 

Resume 

L'halieutique etudie les consequences des politiques de gestion des 
peches en modelisant l'abondance des populations d'interet. Des indices 
d'abondance sont utilises pour ajuster des modeles de dynamique de popu- 
lation. Pour les peches a la palangre l'indice d'abondance le plus couram- 
ment utilise est le nombre de prises par unite d'effort (CPUE), c'est-a-dire 
le nombre de poissons de l'espece etudiee divise par le nombre d'hamecons. 
La competition entre especes et les hamegons remontes sans appat peuvent 
influencer ce type d'indice d'abondance. L'influence de la competition 
entre especes sur les indices d'abondance de palangre a deja ete etudiee 



mais les hamegons remontes sans appat et sans prise sont systematiquemnt 
ignores. Dans cet article nous proposons differentes methodes pour coons- 
truire des indices d'abondance pour la palangre qui tiennent compte de 
la competition et des hamegons vides. Nous montrons que nieme en cas 
d'absence d'information sur l'origine des hamegons vides, les indices pro- 
poses ont un biais constant, independant du niveau de competition ce 
qui ne pose pas de probleme dans le cadre de la gestion de stock. Les 
CPUE conduisent a de mauvaises estimations quelque soit le niveau de 
competition et la proportion d'hamegons vides. Comprendre la cause des 
hamegons vides permet de selectionner l'indice le plus approprie. Une cam- 
pagne de suivi scientifique est organisee tous les deux ans dans le detriot 
de Georgia en Colombie Britanique par Peches et Oceans Canada. Les 
methodes presentees sont appliquees a ce cas detude pour construire des 
series d'abondance de 2003 a 2009 pour des Sebastes de Roches Seabstes 
maliger. 



1 Introduction 

Many fish stock assessments derive information about stock trends from anal- 
yses of longline catch and effort records (IPHC, ICCAT, IATTC references). 
The classical relative abundance index, Catch Per Unit Effort (CPUE), for 
populations monitored using longline gear, is defined as the average number 
of individuals of target species caught per hook and minute of soak time. 
This commonly used CPUE index of abundance ignores the variability in- 
troduced by the comp e tition for baited hooks within and between species. 



Somerton and Kikkawal (J1995I ) propose two different indices, based on instanta- 



neous rate of catch fro m longline survey records whi c h take s interspecific com 



petition into account. lHaimovic and Avila-da Silval (|2007l ) claim that CPUE 



and the instantaneous rate give the same results. Another important source of 
variability in the abundance indices which has received relatively little attention 
is the presence of empty hooks i.e., hooks returning without bait or fish. 

This paper generalizes the use of instantaneous rat es of catch from longline 
survey s as relative abundanc e indices (as proposed by ISomerton and Kikkawa 
(1993) and iRothschildl ( 19671 )) and evaluates alternative approaches to dealing 
with empty hooks in the formulation of longline survey stock trend indices. 

The paper is divided into three main sections after the introduction. Sec- 
tion [5] reviews existing methods for derivating relative abundance indices rom 
longline catch and effort records. Section [3] presents our generalization of the 
previous methods to account for empty hooks. Some simulation studies are con- 
ducted to compare the indices under different levels of interspecific competition 
and sources for empty hooks. Section 2] gives the results of the simulation stud- 
ies and illustrates the behaviour of the indices for monitoring the abundance of 
quillback rockfish (Sebastes maliger). The full technical details are outlined in 
the appendix. 



2 Review of methods to derive abundance in- 
dices from longline records 

2.1 Catch Per Unit Effort (CPUE) 

In a longline survey CPUE is defined as the ratio between the number of fish of 
the target species caught Nt and the number of hooks N times the soak time 

CPUE- * Tm ' get = ^L. 

# hooks x SoakTime N S 

This index ignores the effects of competition and gear saturation. 

2.2 The simple exponential model 

Somerton and Kikkawal (J1995I ) have proposed two alternative approaches to deal 



with the issues of hook competition and gear saturation. Only one is useful in 
our context because dealing with gear saturation requires observed capture times 
which are not recorded in the available dataset. The number of available baits 
on the longline is supposed to decrease at an exponential rate measuring the 
overall pressure on the hooks. This overall pressure may be split into a sum of 
relative abundance indices per species. Using this approach the k th catch C r ^ 
for a soak time S and for species r = 1, . . .R is given by: 



C r , k = ^N(l-e- xs )+e r , k , with e r , k *~" Af(0, a 2 ), (1) 

where N stands for the initial number of hooks, A is the overall pressure on 
hooks, and A = ^2 r A r is the sum of the specific abundance index per species, 
and k is the number of the set. The parameters (Ai, . . . , Xr) defined a relative 
abundance index for each t spccic4j. 

A few of the drawbacks of this Simple Exponential Model (SEM) are the 
assumptions of normality and homoscedasticity of the error terms. It is intuitive 
that the variability should be higher for species with higher relative abundance, 
i.e., the variance of the error term should not be constant but should depend in 
some way on X t . Furthermore N tk is a discrete number (number of fish caught), 
potentially small and the normal assumption is not accurate in this case. 

2.3 Multinomial Exponential Model 

This section reviews the underlying ideas of ISomerton and Kikkawal ( 19951) and 



proposes an alternative model which more closely mimics the behavior of the 



1 Mostly one particular species is the target species, let r = 1 while the other species, 
r > 1 are non-target species. In this context it is easier to consider only Ai = Ay the 
relative abundance for the target species and A2 = Ajvt the relative abundance index which 
summarizes all the other species. 



fish. This model has been originally proposed by iRothschildl (|1967| ) and de- 
scribes how the catch of a target species could be reduced by the catch of other 
species. 

Let us define Tt as the time it took to catch an individual from the tar- 
get species on one particular hook. Tt is assumed to follow an exponential 
distribution of rate At, i-c 

P(T T >u) =c- XtU . 

Tnt is an exponential random variable with parameter Xnt and models the 
time it took to catch an individual from any of the non-target species. We can 
define T — min {Tt,Tnt} as the time it took to catch an individual whatever 
the species it is. Thanks to the property of the exponential distribution, T is 
exponentially distributed with rate A = Xt + Xnt- This property justifies the 
decompo sition of the overall relative ab undance as a sum of specific abundance 
given by ISomerton and Kikkawa! ( 19951) . 



After the soak period S, for one hook, there are only three possible outcomes: 

• {/ = 0} = { The hook is still baited.}. It means that the time for a cap- 
ture is greater than the soak time. This event occurs with probability 

P(I = 0) = P(T> S) = e~ xs 

• {/ 7^ 0} = { The hook is no longer baited.}. 



\I ^ 0) = P(T < S) = 1 - e 



-AS 



Given the hook is no longer baited, there are two possible outcomes: the 
catch is either from the target species, i.e. {/ = T}, which occurs with 
probability 

P(7 = T) = P(T T < T NT \T < S)¥{T < S) = -f{l - e~ xs ), 

A 

or the catch is from a non-target species, corresponding to the event 
{/ = NT} which occurs with probability: 

P(J = NT) = P{T NT <T T \T< S)P{T <S) = ^(1 - e" AS ). 

A 

Assuming that all the hooks on a longline behave independently, the likeli- 
hood is given by 

L(X T , X NT ) ~{ Nb ){ Nt + Nnt )(e ) (I"* ) [~ ) 

(2) 
where 



• N is the number of hooks on the longline, 

• Nb is the number of baited hooks at the end of the soak time, 

• Nt is the number of individuals of the target species caught, 

• Nnt is the number of individuals of the non-target species caught. 

The combinatorial terms arise since all the hooks are considered independent 
and the order of the c atch on the longli ne has no importance. This model was 



originally proposed bv lRothschildl (J1967I ) although presented here with a slightly 



different approach. This model is called Multinomial Exponential Model (MEM) 
since the vector (Nb, Nt, Nnt) follows a multinomial distribution whose vector 
of probability depends on an exponential term. 

If Ant is larger than At, it corresponds to a high level of competition: for a 
given relative abundance of the target species At, the catch decreases as A^t, 
the non-target species relative abundance, increases. 

2.4 Links between the indices 

2.4.1 Links between MEM and SEM 

The expected number of fish caught of the target species Nt is the same under 
the MEM and SEM assumptions and is given by: 

E(N T )=N^(l-e- xs ). 
A 

Moreover, the models share the same parameters, At and Ant- The main 
difference is the error term. In the SEM, the error term is normally distributed 
with a variance given by: 

^ar SE M(N T ) = Yar{N NT ) = a 2 , 

while in the MEM the total number of fish caught has a multinomial distribution 
and the variance is given by: 



Var MEM (N T ) = N?f(l - e - AS ) (l - ^(1 - e - AS )) 
Var ME M(N NT ) = N^(l - e - xs ) (l - ^(1 - e - AS )) . 

Furthermore Nt and Nnt are assumed to be independent in the Simple Expo- 
nential Model and not in the Multinomial Exponential Model. 



2.4.2 Links between CPUE and MEM 

Under the MEM assumption, the expected CPUE of the target species is given 
by: 

E(aP^) = ^(l-e-^ o A T 

If the overall density index A is small enough, meaning that there is little com- 
petition, CPUE and the MEM index give the same results. This theoretical 
result is consistent with the expected behavior. 



3 Dealing with empty hooks 

In longline experiments, it is common for some hooks to return empty; the 
hook is no longer baited, but there is no fish on it. There could be several 
explanations for these empty hooks such as mechanical removal of bait during 
gear setting/retrieval, consumption of the bait by invertebrates or fish without 
being hooked or removal of the hooked fish by predators. 

In this paper, we consider the hypothesis that empty hooks arise only from 
the escape of fish. Therefore, the question about empty hooks is reduced to 
"How should the empty hooks be allocated to the different species?" These 
empty hooks provide information that we could use to improve the quality 
of our abundance indices. This section describes modifications of the MEM to 
incorporate this information and details some statistical properties of the indices 
built using these versions of MEM. It also describes different ways to include 
the empty hooks information into the SEM. 

3.1 Full version of the Multinomial Exponential Model 

We propose a modified version of the Multinomial Exponential Model to ac- 
count for empty hooks. As opposed to the previous version of the MEM, here 
each fish caught has a probability of escaping equal to pr, for target species, 
and pnt, for non-target species. 

We use three additional variables to fully specify the model: Ne is the 
number of observed empty hooks; N E (respectively N E ) stands for the 
number of target species (respectively non-target species) individuals which have 
escaped and these two random variables arc not observed. Assuming, as for 
the simple version of MEM, that all hooks are independent, we are able to 
conditionally describe the outcomes (Figure [TJ. 

• The number Nb of baited hooks retrieved at the end of the soak time is 
the realisation of a binomial random variable with probability of success 



e" AS . 



N B ~ B (N, e 



-\s\ 



• Among the N — N B empty hooks, the total number of individuals from 

(T) 

target species caught is Nt + N E and is also binomially distributed: 
N T + N ( P \N b ~b(n~N b ,^- 

(T) 

• Given Nt + N E , the total number of individuals from target species on 
the longlines is Nt and is also binomially distributed: 

iV T |iV T + NJP ~b(Nt + Ng\ (1 - p T j) ■ 



Given N, 



NT 



N 



(NT) 



the total number of individuals from non-target 



species on the longlines is Nnt and also has a binomial distribution: 
N NT \N NT + N { E NT) ~ B [n nt + N E NT) , (1 - PNT )) ■ 




NT N C§> 

N +N 
- ^& r 




Figure 1: Conditional description of the model. The observed quantities are 
solid lines, the hidden quantities are dashed lines. 

The full version of Multinomial Exponential Model is summed up through 
a probability tree in Figure Q] N E ' and N E ' are missing quantities but the 
sum Ne of these two quantities is observed. Appendix IB . 1 1 gives the main step 
to define the likelihood of this model: 
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The full version of Multinomial Exponential Model may be considered as a 
multinomial distribution: 



N E 



(N b ,N t , N nt , N e ) ~ M (N, a) 



with a 
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XtPt + ^ntPnt 
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In this full version the model is not identifiable since an equivalent version 
could be expressed with only three parameters A, At(1 — Pt) and Xnt(^ ~Pnt) 
(see annex HOI for more details). Some additional information is required to 
estimate the parameters in this model. It is possible to add some biological 
knowledge on the probability of escape through prior distribution in a Bayesian 
framework but no information of this kind is available for our case study. This 
point is discussed in section [5] In a frequcntist approach some particular solu- 
tions have to be chosen. In this paper we focus on two reasonable choices: 

1. MEM1: empty hooks come only from non-target species, so pt is assumed 
to be 0. Most of the time the target species is less abundant than all the 
non-target species. Allocating the empty hooks to the non-target species 
will at worst lead to an underestimation of the target species. Furthermore 
baited hooks arc designed to catch and retain the target species. 

2. MEM2: another reasonable choice is to assume that the probability of 
escape is the same for target and non-target species, i.e pr = Pnt- An 
empty hook has had the bait stolen by a fish but no information about 
the species of this fish is available so that the empty hooks are allocated 
according to the relative densities of each group. 

3.2 Maximum likelihood estimation of MEM 

As all the longline sets are supposed to be independent, the complete likelihood 
is simply the product of the likelihood for each experiment given by formula [3] 
At this stage we have to consider two different situations: variable soak times 
or similar soak times. 

If the soak times are different for all the longline sets, no analytical formula 
for the estimators is available, the estimation step has to be performed using a 
non linear optimization algorithm. 

Mostly, the longline experiments have been designed to share the same soak 
time to reduce the causes of variation in the experiment. In this case, analytical 
formula can be derived for MEM1 and MEM2 because all the information can 
be summed up through the vector (Nb+, Nt+, iVjvT+, Ne+) which corresponds 

t0 (J2i=i N Bi , J2i=i N Ti , J2i=i n nt l , J2i=i N Ei), where I is the number of the 
longline set. 

Even if the design of the experiment prescribes a constant soak time for 
each set, the actual soak time can differ slightly due to weather conditions or 
practical reasons. If the difference is not important, it is judicious to consider 
a single soak time (the mean for instance) to avoid the need of a numerical 
optimization which produces some instability in the estimations. 

As detailed in appendix IB. 31 the maximum likelihood estimators are given 
by: 
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(4) 

Maximum likelihood estimators are asymptotically unbia sed and the cov ari- 
ance matrix is the inverse of the Fisher Information Matrix ( Severinil . 120001 ) . If 
the total number of hooks N is large enough, the joint distributions of these 
estimators can be approximated by a Multivariate Normal distribution. Asymp- 
totically the covariance matrix for MEM1 is given by: 
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The result proposed by iRothschildl (J1967I) concerning the asymptotic vari- 
ance for the simple version of MEM with px = Pnt = should be the same 
replacing p or pnt by in the above formula. Nevertheless, the two formulas 
are not compatible even if the estima tors arc. We suspect a mistake in the 
formula proposed by IRothschildl (J1967I) . 



3.2.1 Bayesian framework for MEM 

The multinomial exponential model could also be estimated in a Bayesian frame- 
work. The full specification of the Bayesian version of the MEM, requires the 
definition of some prior distributions for the parameters (\t,^nt,Pt,Pnt)- 
The relative abundance index is always less than 1. Therefore, in our study, the 
priors have been chosen as poorly informative and independent. 

X T ~ 0(1,1) Ajvt ~ 0(1, 1). 

If there is no informative prior on the probability of escape, the model is still non- 
identifiable. In a Bayesian framework, an identifiable model can be diagnosed 
since the posterior distribution is the same as the prior distribution. Nothing 
has been learned from the data. 

To avoid this problem, informative prior distributions could be defined using 
biological knowledge or field experiment. This aspect hasn't been investigated 



in this work. We use the specific forms of the model (MEM1 or MEM2) to 
remove the problem of identifiability. 

The es t imatio n procedure has been implemented using the JAGS software 



( Plummeii 120081 ). An example of JAGS code is provided in section fC.ll for 



MEM1 and IC. 21 for MEM2 and all other codes are available on request. 

3.3 Estimation for other indices 

This section will discuss the estimation steps of all the indices previously pre- 
sented and highlights the key points of the procedure. 

3.3.1 Multiple CPUE 

In a given region, you could have several sets of longlines deployed, so the 
estimation step requires the fitting of one index using several observations. 

If only one set is deployed, the CPUE is obtained by the catch over the soak 
time multiplied by the number of hooks. A simple generalization of this index 
is detailed in Appendix [3] and proposes to compute CPUE as 

cpue-^ Nti 



Et SiN t ' 

where I = 1,...,L stands for the longline set. If the number of hooks or the 
soak times are different, the generalized CPUE computed is a weighted average 
of all individual CPUE. A simple average is not sufficient, since for example, the 
average will attribute the same weight to an experiment with 200 hooks and to 
another experiment with only 50 hooks. 

3.3.2 Simple Exponential model 

In the following, two versions of SEM are derived depending on how empty 
hooks are considered: 

1. SEMI: empty hooks are assumed to arise only from the non-target species 
and Ne and Nnt are pooled together. 

2. SEM2: empty hooks are considered as a "third" species, and an additional 
relative abundance index is defined A^. 



Hovgard and Lassenl (2000J) proposes to estimate A using 
- - log (N B /N) 

*Hov = ^ ■ 

If all longline sets share the same soak time and the same initial number of 
hooks N, the MLE for this model are almost the same than for the MEM with 
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the exception of empty hooks. 
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To estimate cr at least two longline sets are required: otherwise the estimates 
correspond to a perfect match and there is no additional variability. 
We are only interested in building abundance indices for the target species and 
since the estimations of At in SEMI and SEM2 are the same, in the following 
we call this index SEM. 

When the soak times or the initial number of hooks are different, a numer- 
ical optimisation algorithm has to be used to defined the MLE for SEMI and 
SEM2. This approach should be avoided if possible due to numerical insta- 
bility. From a practical point of v iew the nlm function available in R software 
( R Development Core Teaml 120091 ) behaves badly. In this work we directly op- 
timize the log likelihood function using function optim. 

The analytical formulas for At are exactly the same for SEM and MEM1: if 
soak times and the initial number of hooks are the same for all sets, there is 
absolutely no difference between these two indices concerning the relative abun- 
dance of the target species, which is not true for the non-target species. But 
this analytical formula is only valid for SEM when soak times and the initial 
number of hooks are the same while MEM only requires the same soak times. 



3.4 Simulation studies 

Bias and variability of the estimators are evaluated in this section through 
simulation studies. Several plausible scenarios have been studied to give some 
robust results and advice about the behaviour of all the indices. 

Specifying data generators is the first step of simulation studies. In our spe- 
cific case, we have at first glance, different solutions. We can use the Simple 
Exponential Model but this model doesn't simulate any empty hooks, and it pro- 
vides some non-integer catch because of the normal hypothesis. The two other 
solutions (ie MEM1 and MEM2) will be used to study the different scenarios. 
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This choice of data simulator gives obviously an advantage to MEM1 if the 
data were simulated with MEM1 and to MEM2 in the other case. 

For one set of fixed parameters (that is At, ^nt, L the total number of sets, 
S the soak time, N the number of hooks on a longline) 5000 fake datasets are 
generated and the corresponding estimated values for At and A^t have been 
computed. A relative bias and a coefficient of variation are derived from these 
simulations. To study the impact of the estimation via a numerical algorithm, 
we also compute the estimators by maximizing the log likelihood using a non 
linear optimization algorithm. 

The values At and Ajvt need to be chosen to reflect a plausible situation. In 
this work, four values of each parameter have been used and all of the sixteen 
combinations of these two parameters have been addressed. 

Three different scenarios for empty hooks have been simulated: 

Sc.l) There are no empty hooks. Each hook has caught a fish. This situation 
corresponds to p^r = Pt = 0. 

Sc.2) The ability to escape is the same across species. The probability of escape 
is set to 20%, this corresponds to pr = Pnt = 0.2. 

Sc.3) non-target species are better at escaping. The probability of escape is set 
to pnt = 0.2 for the non-target species and to pt = 0.02 for the target 
species. 

The results of this simulation study are presented in section 14.11 

3.5 Description of the B.C. inshore rockfish longline sur- 
vey 

Since 2003, Fisheries and Oceans Canada has conducted an annual research 
longline survey in the Strait of Georgia with the fisheries research vessel CCGS 
Neocaligus. Different regions of the Strait are covered each year, resulting in 
each statistical area (PMFA) being surveyed every two to three years. A 2 km 
by 2 km grid is overlaid on all inshore rockfish habitat up to 100 m in depth, 
as determined using Canadian Hydrographic Service charts. These blocks are 
stratified by depth into shallow (41-70 m) and deep (71-100m) and 8% of the 
blocks, in a given statistical area, are randomly selected for fishing each year 
(see Lochead and Yamanaka 2007 for further details). 

The snap- type longline gear consists of 1800 ft of leaded groundline with 225 
circle hooks (13/0) spaced 8 ft apart. Each hook is attached to the snap by a 
1.2 ft perlon gangion, crimped at both ends, and with a swivel at the hook. The 
hooks are baited with Argentinean squid. Soak time for each set is two hours 
and measured as the time from deployment of the last anchor when setting the 
gear to the retrieval of the first anchor on board when hauling. 

As the gear is retrieved, the condition of each hook is recorded as returning 
with bait, with catch, empty (i.e. without bait or catch on the hook), or un- 
known, if the hook does not return. Catch is recorded to the species level for 
both fish and invertebrates. 
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4 Results 

4.1 Simulation studies 

4.1.1 Competition but no empty hooks 

Figure [2] presents the bias for the two indices in the absence of empty hooks. 
SEM, MEM1, MEM2 produce exactly the same results because there are no 
empty hooks and the analytical formula has been used. 
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Figure 2: Relative bias, defined as the absolute value of the bias divided by 
the true value |At — At|/At, expressed as a percentage computed over 5000 
simulations for 220 hooks per longline and 20 sets. Bias for SEMI, SEM2, 
MEM1 and MEM2 is the same in this case. On the right the bias computed 
for CPUE indices show that the bias increases with the increase of relative 
abundance of the non-target species. 



The estimations are unbiased for abundance indices built with the exponen- 
tial model and so competition is effectively taken into account. On the other 
hand bias in the CPUE index increases with increasing relative density of the 
non-target species. This result confirms that the CPUE index strongly depends 
on competition and should be avoided. This behavior is always the same in all 
situations which have been addressed in this simulation study 

The coefficient of variation presented in Table Q] depends on the number of 
data relying on N and L but also on the relative abundance, parameters At and 
Ajvt- This coefficient could be very high for low relative abundance situations. 

The coefficient of variation of the estimators for the relative abundance de- 
creases with the expected number caught, which is in this study, L N^j- (l — e~ AS ) . 
The relation between these two quantities is illustrated by Figure El This ex- 
pected number caught describes the actual available information on Xt- 

The bias study shows that CPUE behaves poorly when interspecific com- 
petition occurs. In the following the results concerning this index will not be 
shown. 
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Table 1: Coefficient of variation (%) in estimates of At using the MEM models. 
Results For SEM, MEM1 and MEM2 are exactly the same since the soak time 
is shared by all longline sets and there is no empty hook. 
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Figure 3: Coefficient of variation for the estimates of At for MEM1 computed 
over 5000 simulations as a function of the expected number of catch for target 
species 
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4.1.2 Competition and empty hooks 

SEM and MEM have the same behaviour when no empty hooks are present in 
the dataset. Simulations of empty hooks allow the comparison of the respective 
behavior of those indices in presence of empty hooks. In our simulation context, 
the soak time and the initial number of hooks are constant in all sets, so that 
an analytical formula can be used to compute the indices and SEM and MEM1 
produce the same results. 

Since MEM1 and MEM2 relics on two different hypothesis concerning the 
origin of the empty hooks, they give some different results. When all species 
are equally good at escaping (Sc.2) MEM2 produces unbiased estimators of the 
relative abundance while SEMI, SEM2 and MEM1 tend to underestimate this 
relative abundance. In the simulations used to produce Figure|4l the probability 
of escape for non-target and target fish has been set to 20%, which corresponds 
to the underestimation of 20% for the relative abundance At. 
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Figure 4: Simulations under Scii. MEM1, SEMI and SEM2 produce the same 
results and tend to underestimate the relative abundance. MEM2 is the "true" 
model in this situation and tends to produce unbiased estimate. 



When the non-target species are better at escaping, MEM2 tends to overes- 
timate the relative abundance of the target species. Results presented in Figure 
[5] are produced when px = 0.02 and p^r = 0.2. MEM2 attributes a proportion 
of the empty hooks to the target species, this proportion depends on At and 
Ajvt- Since in this simulation most of the empty hooks arise from the non-target 
species, the higher the relative abundance of non-target species is, the more the 
relative abundance of target species is overestimated. The bias in the estimates 
for MEM1 is constant and equals 2% which correspondsto the missed fish from 
the empty hooks. 
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Figure 5: Simulations under Scl: MEM1 and SEM underestimate the true 
relative abundance by exactly the probability of escape for non-target species. 
MEM2 highly overestimates the relative abundance. 



4.1.3 Numerical instability 

In order to study the numerical instability of the optimization algorithm, the 
maximum likelihood estimators have been computed through the analytical for- 
mula and using a numerical optimisation algorithm on the same data set (with 
shared S and shared N). Which ever scenario is used, the optimisation algo- 
rithm behaves well, i.e, less than 5% of difference between the estimates com- 
puted using the analytical formula and the estimates obtained by numerical op- 
timization except when the ratio between the relative abundance of non-target 
species Xnt and the relative abundance of the target species At is very high. 
In the extreme case, with Ajvt = 0.01 and At = le — 05 the average difference 
between the two estimates varies from 10% to 40%. This poor behaviour oc- 
curs when the log-likelihood peak is not strong enough. Some examples of this 
behaviour concerning the optimisation step are illustrated on the real data in 
section l4~2l 



4.2 Rockfish survey results 

Figure E] shows the different relative abundance indices obtained using the sci- 
entific survey described in section GT5] The confidence intervals have been com- 
puted using a bootstrap procedure with 5000 resamplcs. 



The estimate for the numerical version of SEM index exhibits a considerable 
difference in trends and the confidence interval associated with this estimate 
is huge. This is due to a numerical instability problem. Indeed, there are 
very few quillback caught in 2007 in area 13 as suggested by the decrease in 
the trend for all other indices. Therefore, the numerical optimization procedure 
behaves poorly which produces a poor estimate but also a large variability using 
a bootstrap procedure. 

Within the Strait of Georgia survey, the northern areas (12 and 13) have been 
surveyed together in the same years (2003, 2004 and 2007) and have alternated 
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Figure 6: Four indices computed for area 13 for the quillback population. The 
numerical optimisation has some stability issues for year 2007, the results should 
be the same then MEM1 estimates. 

with surveys in the southern areas (14-20). Since management isn't applied at 
the area scale, data from area 12 and 13 have been pooled to form one dataset. 
The corresponding relative abundance time series is shown in Figure [7] 



Pooling the data from both areas produce more precise estimates as shown in 
Table [2] and solves the numerical instability for the SEM index. The coefficient 
of variation is divided by almost two when using the whole dataset. We didn't 
include the other areas of the Strait of Georgia (14-20) in the study since they 
were only surveyed once in 2005. 

Given the uncertainty on the relative abundance indices, no change is statis- 
tically significant in the relative abundance of quillback population in the strait 
of Georgia. 

5 Discussion 

The first and most important conclusion of this study is that when interspecific 
competition was accounted for in simulated data, the classical longline CPUE 
estimators gave strongly biased estimates of stock trends in all cases evaluated. 
In contrast, the exponential model-based estimators showed much less bias. 
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Figure 7: Four indices computed for area 12 and 13. 



Therefore it is advisable to avoid the use of classical CPUE as much as possi- 
ble since this index doesn't take competition into account. From one year to 
another, the level of competition can vary and two CPUE indices computed dur- 
ing two different years are not comparable which is unacceptable for a relative 
abundance index. 

In the absence of empty hooks, the abundance indices built on SEM or MEM 
are comparable even if the fundamental assumptions of SEM and MEM are very 
different. If Nt is considered as a sum over all hooks of the number of hooks 
which have caught a target individual, the central limit theorem claims that if 
the number of hooks is large enough Nt exhibits a normal distribution. The 
inde pendence o f hook s is not required, only some weak dependence is necessary 



independence or nooks is not required, only some weak dependence is necessary 
(see iBillingslevi ( 19951 ) for central limit theorem under weak dependence condi- 
tions). SEM doesn't assume the independence on hooks but Nt and Nnt are 
considered as independent variables which is obviously false especially when this 
model is used to take competition into account. This explains how the number 
of non-target species caught Nnt could decrease the number of target species 
caught Nt- In contrast MEM models the dependence between Nt and iVjvr 
but assumes the independence of hooks. 

Ignoring empty hooks will produce poor indices because most of the empty 
hooks result from fish escapement which should be counted for in the relative 
abundance index. Empty hooks are of major importance to building abundance 
indices even if there is no perfect solution to deal with them. Biological knowl- 
edge can be very useful for deciding how to deal with empty hooks and the 
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Coefficient of variation (%) 


Index 


Area 13 


Area 12 and 13 




2003 
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2007 


2003 


2004 


2007 


MEM1 


24.1 


26.6 


22.7 


15.9 


12.6 


13.6 


MEM2 


23.5 


29 


23.1 


15.6 


13.6 


13.3 


SEMN 


25.8 


30.2 


29.8 


15.6 


13.1 


13.9 


CPUE 


26.4 


28.1 


24.7 


16 


13.4 


14.1 



Table 2: Coefficient of variation of the different relative abundance indices 
expressed in percent. The variability strongly decreases when data are pooled 
together. The large value for the coefficient of variation for 2007 for the numer- 
ical version of SEM is due to optimization instability. 



Bayesian framework offers an intuitive way to use this kind of information to 
remove the non identifiability problem. But this biological knowledge is hard to 
obtain since the escapement of fish from hooks is difficult to study. MEM1 and 
MEM2 require fully explicit choices concerning the empty hooks. Even if this 
required choice is hard to make, the explicit choice should still be considered. 
In the SEM model the choice is made by default and it is even not explicit. 
We recommend that practitioners design studies to collect information about 
the ability of target and common non-target species to escape and work in a 
Bayesian framework using the code provided in Annex [C] The major advantage 
is that the uncertainty about empty hooks is translated to uncertainty about 
the relative abundance indices. If absolutely no information is available, the 
recommendation is to use SEMI since the bias on the relative abundance index 
doesn't depend on the abundance of non-target species. Furthermore, multi- 
plicative constant bias as seen for MEM1 doesn't produce biased estimates of 
population dynamic parameters model since the multiplicative constant is ab- 
sorbed by the coefficient of proportionnality which links abundance indices and 
biomass. 

One major drawback of all the models formulated in this paper is the as- 
sumption of constant relative abundance along the longline set and in d epend ence 
between hooks which are obviously not true. For instance, ISiglerl ( 20001 ) has 
shown that at least the catch rate decrease with time for sablefish (Anoplopoma 
fimbria). Different approaches could be explored to avoid this assumption. The 
first one would be to record the change of habitat along the longline set and 
using this as a covariate in the model. Another possibility would be to refine 
the modeling of the abundance index. For instance, we could consider a local 
relative abundance index At/i at hook h defined as the sum of the main rela- 
tive abundance At plus a noise term. The noise term would be chosen as an 
autoregressive model for example to use the information of the hooks in the 
neighbourhood. This extra variability could account for the variability in the 
habitat. Another perspective for dealing with the variability along the longline is 
to define the abundance index as piecewise constant function along the longline 
and trying to detect the change in this function using the tools of change-point 
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detection ( Lavielle and Lebarbierl 120011 ). 



The possible variation of At during the soak time is also a question of inter- 
est. Some species could be more attracted by a fresh bait and therefore A would 
be supposed to dec rease with time. This q uestion of attractivity of the baits 
has been studied bv lFerno and Olsenl ( 1995 ) in chapter 8 but currently there is 



no solution to take this into account when building some abundance indices. 
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A CPUE computed from several observations 

A CPUE index may be considered as the estimation of the probability of suc- 
cesses in a binomial trial. If Nt stands for the number of success and p is the 
probability of success over N x S trials, then 

N T ~B(j>,N x S). 

The maximum likelihood estimator p is given by Nt/(N x S) =CPUE. If 
independent records Nt, for L longlinc sets arc available with distribution 
Nt, ~ B(p,Ni X Si), a sufficient statistic is the sum of all the catch JVjT which 
is distributed as 

Y^Nr^N+^Bip^NixSt). 
i i 

The maximum likelihood estimator is defined as 

Eli N Tl 

Ei SiNi ' 

which defines a generalized CPUE definition. 



B More details about the MEM 

B.l Likelihood of MEM with empty hooks 

This section gives the key steps of the calculus for the likelihood given in formula 
[3) We use brackets to denote pdf's as many conditioning terms will appear 
in the probabilistic expr essions derived from the f ull version of Multinomial 
Exponential Model. As in lGelfand and Smith! ( 19901 ). the brackets denote either 



a density or a discrete probability distribution. 
The likelihood is then defined by: 



1(\tAnt,Pt,Pnt) = [Nb,N t ,N nt ,N e \\t,^nt,Pt,Pnt}, 

= [Nb\^t, Aatt] [Nt,Nnt,Ne\Nb,\t,^nt,Pt,Pnt] ■ 

By definition of the model [Nb\Xt, Xnt] is a binomial distribution. We 
need then to define the joint distribution of (Nt, Njvt, Ne) given Nb the total 
number of unbaited hooks. To obtain this distribution, we need to explicit the 
integration term over the hidden quatities N T ' and N^t ■ To make the writing 
easier to follow all the parameters will be omitted in the conditionning term. 
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The likelihood is then obtained by combining the binomial distribution of 
N B with this previous result to give equation [3] 

B.2 Identifiability 



To prove that the full version of the MEM is not identifiable it is sufficient to 
express the likelihood with only three parameters. Let us define a = —f-{l — p<r) 
and 8 — ^^(l — Pnt)- Therefore the likelihood given in equation [3] may be 
rewritten as: 

^ *, n - Nb > Nt ™ nt , Ne , ^ xs f B (i - e- xs f- NB («r wr- a - « - Pf° 

(7) 
This form of the model is identifiable and it is called the regular form of the 
model. 

B.3 Estimators 

B.3.1 MEM Estimators 

Using the regular form of the model given in equation it is easy to derive the 
log likelihood and obtained the maximum likelihood estimators: 

A=--l (— \ &- — 3- Ntn 

S 0S \N B )' a Nt + N nt + N e ' P ' N t + N nt + N e ' 

Those estimators are quite intuitive: for example a parameter represent the 
relative density of catchablc individuals from target species and it is naturally 
estimated as the ratio between the the catch of target species and the total 
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number of catch including empty hooks. The ambiguity of the model lies then 
in the definition of the relative density, how much the catchable density has to 
be increased? 

The maximum of the logllikelihood function equals 



log 



AH 



log 



N% B N* T N^ T N^ T N% E 



^JVbUVt! N T l Nnt\N ne J ™°1 N n 

This value could be used to compute an AIC criteria. 



In the following we will derive the maximum likelihood estimators in our two 
specials cases of interest. 



B.3.2 MEM1 Estimators 

MEM1 corresponds to the assumption that the target species can't escape, so 
taht pt = 0, in this context the log likelihood is given by 

L (A T , X nt ,Pnt) =K - N B SX + (N - N B ) log (l - e~ AS ) + N T log (A T ) 

+ {N NT + N E ) log (Ajvt) + N NT log (1 - p NT ) - (AT - N B ) log (A) + N E log (p NT ) 

The first derivatives are then derived: 
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The maximum likelihood estimators given on the left side of equation @] are 
then obtained by determining the roots of this set of equations. 

If N the number of hooks is large enough, the distribution of the estimators 
may be approximated by a normal distribution with mean (At, Xnt^Pnt) and a 
matrix variance which is the inverse of the Fisher information matrix F defined 
by 
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where 9 represents a generic vector of parameters. 

It can be shown that the asymptotic variance matrix for MEM1 is given by: 
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B.3.3 MEM2 Estimators 

The equivalent equations for MEM2 (corresponding to the assumption that the 
probability of esacpe is the same for every species)with p? = Pnt = P can be 
obtained from the following LogLikclihood: 

L (At, \nt,p) =K - N B SX + (N - N B ) log (l - e~ AS ) + N T log (At) 

+ N NT log (Ajvt) + (iVjvr + N T ) log (1 - p) - (N T + N NT ) log (A) + N E log (p) 

The first derivatives are then derived: 

dL at (N-N B )Se- xs N T N T + N NT 
= — 1\bo H : — r 



d\ T l-e~ xs At A 

dL Ar (N - N B )Se- xs N T + N NT N - N B 



OXnt 1 — e as Xnt A 

dL _ N T + N NT N E 
dp 1 — p p 



The maximum likelihood estimators are obtained by determining the roots 
of this set of equations. 

The asymptotic covariance matrix is then given by: 
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C Some useful codes 
C.l JAGS code for MEM1 

var 

# data 

# Data is a matrix with 4 columns and NData lines 

# Col 1 corresponds to Nb, Col2 to Nl, Col3 to N2, col4 to Ne 
NData, P [NData], Data [NData, 4] , N [NData], 

#variable requiring initialisation 

lambdal, lambda2, p, 

#variable without initialisation, deduced from the code 

alpha [NData, 4] ; 

model { 

/* prior density */ 

p " dbeta(0. 1,0.1) ; 
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lambdal " dbeta(0 . 1 ,0 . 1) ; 
lambda2 ~ dbeta(0 . 1 ,0 . 1) ; 
lambda <- lambdal + lambda2; 

/* Model part */ 

for(j in l:NData) 

{ 

alpha [ j , 1] <- exp(-lambda * P[j]); 

alpha [j, 2] <- (1- exp(-lambda * P[j])) * lambdal / lambda; 

alpha [j, 3] <- (1- exp(-lambda * P[j])) *lambda2/lambda *(l-p); 

alpha [j, 4] <- (1- exp(-lambda * P[j])) * lambda2 * p /lambda ; 

Data[j,] ~ dmulti (alpha [j ,] , N[j]); 

} 

> 

C.2 JAGS code for MEM2 

var 

# data 

# Data is a matrix with 4 columns and NData lines 

# Col 1 corresponds to Nb, Col2 to Ml, Col3 to N2, col4 to Ne 
NData, P [NData], Data [NData, 4] , N [NData], 

# variable requiring initialisation 
lambdal, lambda2, p, 

# variable without initialisation, deduced from the code 
alpha [NData, 4] ; 

model { 

/* prior density */ 

p " dbeta(0. 1,0.1) ; 

lambdal ~ dbeta(0 . 1 ,0 . 1) ; 

lambda2 ~ dbeta(0 . 1 ,0 . 1) ; 

lambda <- lambdal + lambda2; 

/* Model part */ 

for(j in 1: NData) 

{ 

alpha[j,l] <- exp(-lambda * P[j]); 

alpha [j, 2] <- (1- exp (-lambda * P[j])) * lambdal * (1-p) / lambda; 

alpha [j, 3] <- (1- exp (-lambda * P[j])) *lambda2/lambda *(l-p); 

alpha [j, 4] <- (1- exp (-lambda * P[j])) * p ; 

Data[j,] " dmulti (alpha [j ,] , N[j]); 

} 

> 



25 



